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DuyguAKgAYt [BS], ABSTRACT 

Qetin KOCAEFE^*, [MD], [PhD] The discovery of the structure of DNA and, consequently, of gene sequences 

accelerated efforts towards the implementation of genome technologies for 
the advancement of human health. Despite remarkable progress in biotech¬ 
nology, human genetic diseases are still untreatable at present. Advancement 
in gene transfer technologies in the 90s initiated the first gene replacement 
therapy trials aiming at the correction of selected congenital genetic defects. 
By the beginning of the new millennium, progress in viral vectors primed the 
first successful clinical gene therapy initiatives. Today, biomedicine is on the 
edge of a new era with the implementation of genome editing technologies. 
The introduction of these novel techniques, gene correction therapy, which 
was no more than a fantasy in the last century, is now undergoing clinical 
trials. The key milestones of the nucleic acid technologies that laid the fun¬ 
damentals of gene transfer and gene replacement therapy are summarized 
in this review with special emphasis on the three cardinal genome editing 
technologies: the Zinc-Finger Nucleases, TALE Nucleases and most recent¬ 
ly the CRISPS/Cas system that made genome engineering possible. These 
novel applications recently initiated a wide range of uses, such as the gener¬ 
ation of engineered cell lines, isogenetic model organisms and human clini¬ 
cal trials, towards gene correction aiming for the treatment of cancer, infec¬ 
tious diseases and single gene disorders. The current results and status of 
these novel approaches show great promise for development into therapeu- 

, ,, . . „ , tic applications for the advancement of human health. This review is primar- 
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S eventy years ago, Avery, MacLeod and McCarty 
showed that through the transfer of DNA, unen¬ 
capsulated “Rough” variants of Pneumococcus Type 
II could be converted into fully encapsulated cells 
[1]. Even though this ground-breaking study initiat¬ 
ed the era of gene transfer and microbial genetics, it 
was initially undervalued by the scientific communi¬ 
ty. Following the unveiling of the structure of DNA, 
the genetic code and the concept structure of “the 
gene” were identified [2]. The next challenge was to 
understand the function of the genes and DNA se¬ 
quences. Thus, the initial aim of the introduction of 
various DNA fragments into organisms was explo¬ 
ration of DNA function. These efforts led to the new 
era of recombinant DNA technology, where cutting 
and joining various DNA fragments could be used 


for the production of recombinant genes and pro¬ 
teins. This motive led researchers to seek various 
gene transfer approaches to target cells and organ¬ 
isms and fit various needs. Among these, biotech¬ 
nology, the science of producing protein products, 
advanced much faster than other applications in bio¬ 
medicine. The major challenge was to combine this 
new technology with efficient gene transfer tech¬ 
niques to express protein products in various ways. 
Among these, the cloning of the human growth hor¬ 
mone and insulin pioneered the expansion in bio¬ 
technology in the early 80s. The development of 
methodologies to produce recombinant proteins 
and enzymes for the advancement of human health 
represents key achievements in the history of med¬ 
icine. Today, biotechnology for the advancement of 
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human health is capable of offering pure antigens for 
vaccination, human insulin molecule for the Type I 
diabetic patients as well as recombinant enzymes for 
lysosomal storage disease patients. While biotech¬ 
nology applications are beyond the scope of this re¬ 
view, the number of recombinant protein products 
on the market as well as their indications increase 
each year. However, no FDA-approved gene therapy 
is currently available. 

Initial efforts to identify disease-causing mu¬ 
tations and related genes date back to before 1980. 
At that time, this identification could only be 
achieved by using biochemical analysis of the pro¬ 
tein products. Identification of the hemophilia A, 
Hypoxanthine-guanine phosphoribosyltransferase 
(HPRT) and Glucose-6-phosphate dehydrogenase 
(G6PD) genes were the initial successes of forward 
genetics [3]. The identification of the landmarks of 
the human genome and publication of the first hu¬ 
man genetic linkage map accelerated the positional 
cloning of the genes [4, 5]. The completion of the hu¬ 
man genome project in 2004 aided understanding of 
the gene function and establishing associations be¬ 
tween mutations and disease mechanisms [6]. 

The discovery of the human genome sequence 
created anticipation for the implementation of this 
information on human health. However, crude DNA 
sequence information was just the end of the be¬ 
ginning of the struggle against genetic disorders. 
Gurrently, three fundamental therapy approaches 
exist in dealing with a disease caused by gene muta¬ 
tions. The first is known as “gene replacement ther¬ 
apy”. Here, a “new” (i.e. artificial) DNA sequence is 
integrated into the host genome to compensate for 
the loss of the mutated protein product. This may 
be either achieved directly in vivo, or through re-ad- 
ministration of the genetically modified “autolo¬ 
gous” cells known as ex vivo gene delivery. The clas¬ 
sical clinical application of this approach was the 
much-debated “cure” for the X-linked severe com¬ 
bined immune deficiency (SCID-Xl); the first gene 
therapy trial, in which eight boys had been cured 
[7]. The second approach is the modulation of the 
mutated gene structure to achieve a “partially func¬ 
tional” protein product. Skipping of the exon (s) that 
contains a mutation can be beneficial especially if 
the skipped exon would not disrupt the open read¬ 
ing frame. “Exon-skipping” is the alteration of the 
splicing mechanism to “skip” the mutated exons in 
order to transcribe a shorter mRNA with signifi¬ 
cantly higher function. Several obstacles and issues 


complicate putative gene replacement therapy for 
the muscle tissue, such as the size of the gene, suc¬ 
cessful targeting and delivery into muscle. Exon¬ 
skipping is one promising gene therapy approach 
for Duchenne’s muscular dystrophy. Gurrently clin¬ 
ical trials using exon-skipping approach are ongoing 
[8]. Lastly, the “ultimate treatment” of a genetic dis¬ 
ease is gene correction therapy. This approach can 
be briefly summarized as the direct in situ correc¬ 
tion of the mutated gene. This process was consid¬ 
ered to be no more than a fantasy in the last centu¬ 
ry. However, with the implementation of new tools, 
the promising approach of gene correction therapy 
will soon initiate a new age for the treatment of ge¬ 
netic disorders. 

Gene Transfer 

Initial efforts to transfer DNA into mammalian cells 
were an adaptation of prokaryotic transformation 
approaches. The priming protocols commonly pre¬ 
cipitated DNA onto the in vitro cultivated mamma¬ 
lian cells. These protocols need to be precisely op¬ 
timized for the cell type and were unsuitable for in 
vivo applications due to the lack of any options for 
targeting or tropism. Galcium phosphate precipita¬ 
tion (as exemplified in the next section) or formu¬ 
lation of DNA complexes with cationic compounds 
such as poly-ethylene imin (PEI) were partially re¬ 
placed by liposomal structures over time. These 
non-selective chemical gene transfer approaches 
are still in practice today for research applications. 
In the past, a limited number of clinical gene ther¬ 
apy trials were also initiated using these direct gene 
transfer vectors [9]. Apart from the above-men¬ 
tioned chemical complexes, there are also physi¬ 
cal methods of gene transfer. Electroporation is one 
particular physical gene transfer technique in which 
an electric field is applied to drive DNA into select¬ 
ed cell types. This approach also has a special in vivo 
application niche in skeletal muscle, which is physi¬ 
cally more resistant to such stress [10]. 

Following the introduction of viruses as efficient 
tools for gene transfer, the golden age of viral vec¬ 
tors was initiated. Adenoviruses, adeno-associat- 
ed viruses, herpes simplex viruses and retroviruses 
were extensively studied with the goal of gene deliv¬ 
ery into human cells. The first attempted gene re¬ 
placement therapy using viral vectors was the debat¬ 
ed trial of Martine Cline in 1990. His goal was to ac¬ 
complish ex vivo gene replacement therapy to cor¬ 
rect beta thalassemia using herpes simplex vectors 
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harboring the beta globin gene. Much controversy 
was created due to the fact that he did not obtain 
any institutional approval for this procedure to be 
applied on two patients. Despite the fact that none 
were harmed, neither was any success achieved [11]. 
Until the domestication of the lentiviruses, the ma¬ 
jority of the past gene therapy approaches and clin¬ 
ical trials focused on Maloney Murine Leukemia 
Virus as a retroviral vector. The major breakthrough 
of domestication of the human immunodefiency vi¬ 
rus (HIV) initiated the implementation of lentivirus¬ 
es as stable gene therapy vectors [12]. 

A Brief History of Gene Replacement 
Therapy 

The first gene transfer into mammalian cells was 
in 1962, in which Szybalska and Szybalski showed 
that mutant HPRT (-) bone marrow cell lines could 
be biochemically converted into an HPRT (+) state 
by calcium phosphate-mediated DNA transfer [13]. 
The authors not only described the methodology for 
in vitro gene transfer, but also laid the fundamentals 
of gene replacement therapy for the Lesch-Nyhan 
syndrome. In this rare genetic disorder, patients 
carry mutations in their HPRT gene [14]. Unlike 
the prokaryotic cells, even under optimal condi¬ 
tions, transfection efficiency of DNA into mamma¬ 
lian cells is very low. Besides several previous exam¬ 
ples in the bacteria, Temin demonstrated that the 
Rous sarcoma virus was capable of introducing vi¬ 
rus-specific genes into chicken cells and could in¬ 
duce a clear-cut phenotypic effect [15]. This pro¬ 
vided proof that viruses could be used as tools to 
transfer genes into eukaryotic cells. The first in vi¬ 
tro demonstration of virus-mediated gene transfer 
was in 1973. Researchers inoculated the Shope virus 
in the tissue cultures of patient fibroblasts to restore 
arginase enzyme deficiency [16]. However, a clinical 
trial on hyperargininemic patients in which the vi¬ 
rus was injected intravenously proved to be ineffec¬ 
tive [17]. The first FDA-approved clinical trial was 
initiated in 1990 by William French Anderson to 
treat a genetic defect causing adenosine deaminase 
deficiency. Two children received ex vivo modified 
autologous white blood cells expressing the adenos¬ 
ine deaminase gene. Ashanti DeSilva, renowned pa¬ 
tient and recipient of this approach, initially benefit- 
ted out of this trial but the impact proved to be tem¬ 
porary [18]. Naked DNA (plasmid) was also consid¬ 
ered for the role of a gene transfer vector targeting 
skeletal muscle in myopathy patients. Plasmid DNA 


coding the dystrophin gene was delivered into the 
muscles of nine myopathy patients, but no beneficial 
effect was observed [19]. As mentioned above, the 
first sustainable successful clinical trial for a genet¬ 
ic disease was the gene therapy approach targeting 
the SCID-Xl gene. However, four of the nine boys 
who had enrolled in the trial developed acute lym¬ 
phoblastic leukemia secondary to the insertional ac¬ 
tivation of a proto-oncogene, and one of them died 
[7]. Unfortunately, this occurred just after the trag¬ 
ic death of 18-year-old patient Jesse Gelsinger at the 
University of Pennsylvania in Philadelphia during 
another gene therapy trial [20]. These two incidents 
created a major setback for all gene therapy trials 
(with the resulting cessation of all similar trials) and 
also placed a black label on viral vectors. 

Currently, the number of on-going gene therapy 
clinical trials exceeds 2000. Approximately 65% of 
these are cancer gene therapy trials. This is followed 
by the monogenic (9%), infectious (8%) and cardio¬ 
vascular diseases (7%) [21]. About 65% of these tri¬ 
als are conducted in the U.S.A., followed by 10% in 
the UK, 4% in Germany, 2.5% in France and 2.4% in 
Switzerland. A dedicated website holds and updates 
the relevant catalogue information regarding the 
ongoing gene therapy trials as well as the past ones 
(www.wiley.co.uk/genemed/clinical). Successful 
gene therapy trials initiated for genetic diseases in 
the last decade include Leber’s congenital amau¬ 
rosis [22], |3-thalassemia [23], ADA-SCID [24] and 
Wiskott-Aldrich syndrome [25]. Currently, Glybera® 
is the sole commercial gene therapy product on the 
market. Glybera® is a recombinant adeno-associated 
viral vector harboring lipoprotein lipase (LPL) gene 
and is approved to be used for LPL deficiency by the 
European Medicines Agency (EMA). Phase II clini¬ 
cal trial conducted in Europe proved efficient recov¬ 
ery from LPL deficiency by intramuscular injection 
of viral vectors [26]. However, FDA is still evaluating 
the approval mainly due to the immunological re¬ 
actions raised against the capsid protein of the ade¬ 
no-associated viral particles [27]. 

Gene Correction Therapy: The New Hope 
for Single Gene Disorders 

Two principal routes are commonly pursued in the 
investigation of a gene function. The first one is 
known as “overexpression,” in which the forced ex¬ 
pression of the gene of interest is achieved in target 
cells (or transgenic animals). The second approach 
is the achievement of the “loss-of-function” that 
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would serve as a model for the human genetic dis¬ 
ease caused by mutations. Both of these approach¬ 
es enable the investigation of the ways in which gen¬ 
otypes may influence the phenotype [28]. The for¬ 
mer can be modelled by direct gene transfer using 
appropriate vectors, as discussed above. Similarly, 
the transgenic animals can be generated by the sta¬ 
ble germ-line integration of the transgene. The lat¬ 
ter approach can only be accomplished by target¬ 
ed mutations in the genome. The key to the imple¬ 
mentation of targeted mutations is “homologous re¬ 
combination” (HR) [29]. Through the implementa¬ 
tion of this method, targeted sequence modifica¬ 
tions as well as null mutations can be created in de¬ 
sired genes. Basically, HR is the strand exchange be¬ 
tween identical DNA sequences [30]. Thus, an arti¬ 
ficial DNA molecule (donor vector) with an identical 
sequence may trigger HR in the target locus with se¬ 
quence similarity (Figure 1). Under physiologic con¬ 
ditions, this is an extremely rare event. Even in the 
presence of highly identical sequences, mammalian 
genomic DNA is extremely stable and spontaneous 
recombination practically never occurs. In the phys¬ 
iological state, homologous recombination frequen¬ 
cy is estimated to be one in every 10^ to 10^ cells 
[28]. However, mammalian cells may use homolo¬ 
gous recombination to repair DNA damage [31]. The 


below-described genome editing tools provide an 
opportunity to facilitate this recombination event. 

The key to increasing the efficiency of homol¬ 
ogous recombination is to follow the non-physio- 
logical route. Once a double strand break occurs, 
the probability of repair by homologous recombi¬ 
nation is boosted [28]. The recent applications de¬ 
veloped in the last fifteen years, termed as “genome 
editing tools,” enable the achievement of target¬ 
ed modifications in any gene, both in vitro and in 
vivo. These approaches are based on the creation 
of a precise double-strand DNA break (DSBs) in 
a target sequence (see the glossary box in table 1). 
This is accomplished through a “nuclease” function 
that would hydrolyze the phosphodiester bonds on 
both strands of a DNA helix. As explained above, 
such double strand breaks evoke DNA damage re¬ 
sponse and trigger HR. Here, the most important 
challenge is to direct this nuclease action specifi¬ 
cally to the target DNA sequence. This sequence 
specificity can be achieved by engineered nucleas¬ 
es based on an enhanced DNA recognition proper¬ 
ty. This way, efficient and accurate genetic modifi¬ 
cations can be achieved at a specific locus. Firstly, 
DSBs are introduced at the target locus, and then 
cellular DNA repair mechanisms are provoked 
to do the repair. The choice of the repair method 
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Figure 1. Principle mechanism of homologous recombination (HR) is illustrated. Any double-strand breakage (DSB) 
initiates the DNA repair mechanisms in the cell (A). HR mediated repair is initiated in the presence of an identical 
“homologous” sequence in the cell. Genome engineering approaches require a targeted nuclease to cleave the desired 
locus (to achieve the DSB) and an artificial DNA sequence to be delivered as a donor plasmid (B). HR mediated strand 
exchange incorporates the preferred nucleotide change (designated as M) existing in the donor plasmid vector into 
the genome (C). 
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Table 1. 

Glossary box: 

Double strand breaks (DSBs): The breakage of the backbone of a DNA chain that encompasses both strands. DSBs is DNA damage that occurs following 
exposure to ionization or ultraviolet radiation. 

Isogenetic disease model: A model organism carrying the same disease causing mutation as in humans. 

Nuclease: Enzymatic activity that hydrolyses the sugar-phosphate backbone of the nucleic acids, forming a DSB. 

Restriction endonuclease: An enzyme that cleaves the sugar-phosphate backbone of the DNA by recognizing a specific sequence. 

Zinc-Finger Proteins: Transcription factor with a certain structure (containing a zinc molecule) that exhibits sequence-specific DNA-binding activity. 


defines the nature of the genome editing approach. 
Either a more error-prone “non-homologous end 
joining” (NHEJ) or HR is activated. The former, 
NHEJ, rapidly joins the broken ends. This mech¬ 
anism exclusively introduces one or more base de¬ 
letions or insertions at the site of repair. This pro¬ 
cess can be utilized to introduce loss-of-function 
mutations in desired genes. In case there is a donor 
DNA sequence available for homologous recombi¬ 
nation, recombinational repair mechanism (HR) is 
then activated. Any desired sequence variation can 
be introduced using this engineered HR approach. 
The use of the targeted nucleases with or without 
recombination vectors is called “genome editing”. 
Today, genome editing tools can be used to gener¬ 
ate targeted mutations, gene replacement and gene 
correction, both in vitro and in vivo. 

In order to gain sequence specificity to the nu¬ 
clease action, site-specific DNA-binding proteins 
are utilized. These proteins are derived from (1) 
zinc-finger proteins (ZFPs) (2) transcription acti¬ 
vator-like effector (TALE) proteins and (3) clus¬ 
tered regulatory interspaced short palindromic re¬ 
peats (CRISPR/Cas) [30]. These three novel technol¬ 
ogies are effectively used in many cell types and or¬ 
ganisms and accelerate genome engineering stud¬ 
ies. These novel tools also harbor promising clin¬ 
ical potential in gene replacement therapy. More 
than two decades have passed since FDA approved 
the first gene therapy trial, and the introduction of 
the genome editing technologies, gene therapy ap¬ 
proaches have since accelerated significantly [32]. 
While ZFNs are already being tested in clinical tri¬ 
als, TALEN and CRISPR/Cas technologies are rela¬ 
tively new and have not yet been tested in any clin¬ 
ical applications. Nevertheless, they are currently 
widely in use for a range of cell lines and model or¬ 
ganisms. The current results and the status of these 
new approaches show great promise to develop into 
therapeutic applications. 


Zinc-Finger Nucleases 

The first zinc-finger nucleases (ZFNs) were gener¬ 
ated based on the DNA sequence recognition func¬ 
tion of the zinc-finger transcription factors. Like all 
transcription factors, zinc-finger proteins possess 
a sequence-specific DNA-binding activity. An ini¬ 
tial proof-of-concept study in 1994 was aimed at the 
generation of a chimeric protein by hybridizing the 
zinc-finger domain with a nuclease to enable gen¬ 
eration of sequence specific DSBs [33]. Since then, 
the basic structural concept of ZFNs is laid on two 
separate domains, the DNA-binding and the DNA 
cleavage domains (Figure 2). The cleavage domain 
is primarily a nuclease without any specificity for a 
target sequence. Directing this domain to the spe¬ 
cific locus is the function of the DNA-binding do¬ 
main, or the zinc-finger domain. The DNA-binding 
domain of a ZFN contains Cys2His2 zinc-fin¬ 
ger structures (ZFs) in each unit. Here, an atomic 
zinc molecule is surrounded by approximately thir¬ 
ty amino acids. A single zinc-finger typically rec¬ 
ognizes 3 bp of DNA. The DNA-binding domains 
of ZFNs consists of three to six individual zinc-fin¬ 
ger repeats enabling 9 to 18 bp DNA sequences to 
be identified. Thus, zinc-finger domains can be en¬ 
gineered to target specific sequences up to 18 bp. 
This is generally accepted to be sufficient to target 
a single locus in a mammalian genome. The nucle¬ 
ase domain of the ZFNs is generally derived from 
the Type II restriction endonuclease Fokl. Type II 
restriction endonucleases are only active in a di¬ 
merized conformation. Thus, to achieve functional¬ 
ity, two identical domains need to dimerize around 
one target DNA. Considering the chimeric struc¬ 
ture of an engineered ZFN, two separate zinc-fin¬ 
ger domains need to recognize, and bind two oppo¬ 
site DNA strands of the target site for the two cleav¬ 
age domains to surround and successfully dimerize 
around the DNA helix (Figure 2). This structure re¬ 
quires that the functionality to be reassured only 
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Figure 2. Basic mechanism of action of ZFNs is illustrated. ZFNs are made up of various tandem zinc finger (ZF) mo¬ 
tifs (A). Each motif consists of an alpha helix (designated as the cylinder) as well as two beta sheets where an atomic 
zinc molecule is incorporated (designated as a star). One ZF motif can recognize and bind to 3 consecutive nucleotides 
on the single strand of a DNA helix. Structurally, a functional ZFN is a pair of tandem ZFNs targeting the two oppo¬ 
site strands of a DNA locus. Successive binding of the ZFNs on the two strands enables the functional pairing of the 
FokI nuclease to create a DSB (B). The optimization of the length of the linker sequence between the FokI nuclease do¬ 
main and the tandem ZF motives is critical. 


by the specific binding of the two separate ZF do¬ 
mains recognizing 18 bases, each of which are sep¬ 
arated by 5 to 6 bp spacers. The two 18 bp inverted 
target recognition sequences reassure the specifici¬ 
ty of this application [34]. 

ZFNs have been extensively used for engineer¬ 
ing applications in the genomes of many plants and 
animals, including tobacco, flies, worms, zebrafish, 
mice, rats and numerous mammalian cell lines. Like 
other genome editing tools, ZFNs were also used for 
the generation of genetic disease models also known 
as “isogenetic human disease models” (see the glos¬ 
sary box in table 1) [35]. ZFNs were also evaluated 
as a potential treatment option for the HIV infec¬ 
tion. The HIV virus requires the expression of co-re- 
ceptors C-C chemokine receptor type 5 (CCR5) for 
adhesion onto the T-cells. Thus, CCR5 is a prom¬ 
ising target for the control of HIV entry into the 
host cell. Humans who are homozygous for a par¬ 
ticular variation in the CCR5 gene are naturally re¬ 
sistant to the HIV infection by the blockade of en¬ 
try of the virus into the T-cells (the A32 variation). 
Additionally, in clinical use the transplantation of 
ex vivo expanded CCR5 (-/-) primary human CD4-t 
T-cells to HIV patients was found to be beneficial. 
In the light of this evidence, ZFNs were also test¬ 
ed as a potential option for the treatment of HIV in¬ 
fection. The primary aim of this was to disrupt the 
CCR5 in both primary CD4-i- human T-cells and hu¬ 
man hematopoietic stem cells. A Phase I clinical tri¬ 
al evaluating this approach was recently completed 
[36, 37]. Two separate Phase I clinical trials are still 


ongoing, in which T-cells are isolated from patients, 
treated with ZFNs to block CCR5 expression to pro¬ 
vide an HIV-resistant reservoir of CD4-i- T-cells and 
re-infused into the patients (NCT0I044654 and 
NCT00842634) [31]. Once the targeted efficacy can 
be achieved, a potential treatment option for AIDS 
patients may be provided by ex vivo modification of 
their hematopoietic stem cells using CCR5-specific 
ZFNs. Reconstitution of the patients’ immune sys¬ 
tems using stem cells with an engineered “CCR5- 
negative genome” may render their T-cells immune 
to HIV infections. 

TALE Nucleases 

TALE nucleases are principally derived from the 
Xanthomonas bacteria, which are plant pathogens. 
Xanthomonas use a cocktail of thirty to forty dif¬ 
ferent effector proteins during the infection of the 
host to damage eukaryotic cellular defense pathways 
[38]. The acronym TALE stands for Transcription 
Activator-Like (TAL) Effector and describes a fam¬ 
ily of proteins that can bind to specific genes in the 
plant genome to regulate their expression [39]. TALs 
contain 33 to 35 repetitive amino acid residues in 
a central domain, and the amino acid sequences of 
each repeat are almost identical. However, two con¬ 
tiguous amino acids at the 12* and 13* residues in 
the repetitive region serve a particular function in 
DNA-binding specificity [40]. These two hyper¬ 
variable amino acids, known as the repeat-variable 
di-residues (RVDs), determine the specificity of the 
TALE proteins [32]. The cracking of the RVD code 
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Figure 3. TAL-Like Effector Nucleases (TALENs) are targeted to a specific DNA sequence with the help of their re- 
peat-variable di-residues (RVDs). Here, the consecutive pair of aminoacids shows specificity for corresponding nucle¬ 
otides. These are NI (Asn, He) for adenine, HD (His, Asp) for cytosine, NN (Asn, Asn) or NK (Asn, Lys) for guanine 
and NG (Asn, Gly) for thymine. Each one of the two engineered TALE domains recognizes 18 - 20 bases on opposite 
strands of a DNA helix. This targeted binding functionally provides a specific conformation to the chimeric Fold nu¬ 
clease domain that creates a specific DSB at the target locus. Optimization of the linker sequence between the FokI 
nuclease and the TALE motives is critical. 


enabled the generation of engineered TAL effector 
proteins that may target any desired sequence in the 
genome and provide specificity to the FokI restric¬ 
tion enzyme to create a DSB at the target locus [41]. 
The DNA binding specificity of the RVD residues 
are coded by specific aminoacids that correspond to 
A, C, G and T nucleotides. These are NI (Asn, He) for 
adenine, HD (His, Asp) for cytosine, NN (Asn, Asn) 
or NK (Asn, Lys) for guanine and NG (Asn, Gly) for 
thymine. Like the ZFNs, for genome editing applica¬ 
tions, TALENs are used as sequence-specific DNA- 
binding domains engineered to direct the nonspe¬ 
cific DNA cleavage domains (nuclease) to the tar¬ 
get locus. Similarly, FokI endonuclease can be en¬ 
gineered to form a complex with TALE proteins to 
create a DSB at the target sequence. Again, like the 
ZFNs, FokI needs to dimerize to effectively create 
a DSB. Thus, two adjacent TALE binding sites at 
opposite strands that are separated by a spacer se¬ 
quence of 12-20 bp are required to create a targeted 
cleavage (Figure 3). 

CRISPR/Cas9 System 

The latest achievement in the field of genome en¬ 
gineering is the “Glustered, Regularly-Interspaced, 
Short Palindromic Repeats (GRISPR)-Associated 
protein system (/Gas). CRISPR’s fundamentals He in 
prokaryotes in the provision of adaptive immunity 
against viruses and plasmids [42]. The S. pyogenes 
SF370 Type II GRISPR locus contains four genes: 
Cas9 nuclease; two noncoding GRISPR RNAs (crR- 
NAs); trans-activating crRNA (tracrRNA) and a 


precursor crRNA (pre-crRNA) and nuclease guide 
sequences (spacers) interspaced by identical di¬ 
rect repeats [43]. The primary function of the RNA 
components is to direct the Gas9 endonuclease to 
the specific DNA sequences in order to generate 
DSBs in these foreign genetic sequences (thus pro¬ 
tecting prokaryotic host cells from infection). This 
adaptive immune system is essential to bacteria 
and archaea for protection against invading organ¬ 
isms. GRISPRs were initially identified in the E. coli 
genome in 1987. It was found that E. coli genome 
contains a series of 29 nucleotide repeats that are 
separated by unique 32 nucleotide spacer sequenc¬ 
es. This repeat-spacer-repeat pattern was later ob¬ 
served in diverse bacterial and archaeal genomes 
as well. Further studies reveal that these spacer se¬ 
quences were complementary to various viral and 
plasmid DNA sequences. This proved that GRISPR 
was an important adaptive immune element in pro¬ 
karyotes which also serves as a genetic memory of 
previous infections [44, 45]. 

As mentioned above, the S. pyogenes Type II 
GRISPR system has two varieties of essential com¬ 
ponents. Out of these, RNA components (crRNA 
and tracrRNA) merge to form the complex that pro¬ 
vides the Cas9 protein with the sequence specific¬ 
ity to cleave the target sequence (which is comple¬ 
mentary to the crRNA) [46]. Last year, this natu¬ 
ral two-component system was adapted and intro¬ 
duced to eukaryotic organisms such as yeast, plants, 
and mammals. This novel application was recog¬ 
nized as an easy and effective method of genome 
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Figure 4. The CrispR/Cas9 system targets the DNA sequence through specific RNA - DNA hybrid helix binding. The 
RNA element of the system exerts a dual action. Primarily the RNA pairs and binds to the specific locus (through the 
consensus sequence of G-N20-GG). Once bound to the target site, it also recruits the Cas9 nuclease protein via the 
structural hairpin loop (labelled with an asterisk). The Cas9 nuclease selectively cleaves both strands at the PAM site 
(designated as -NGG sequence). 


engineering in eukaryotes and enabled efficient ge¬ 
nome editing at any target DNA sequence in the 
mammalian genome. In 2013, Mali et al. modified 
the protein and RNA components of bacterial Type 
II CRISPR/Cas system to human cells [47]. Firstly, 
a human codon-optimized form of the Cas9 pro¬ 
tein containing a C-terminal SV40 nuclear localiza¬ 
tion signal was cloned into a mammalian expression 
system. The fusion transcript of the RNA compo¬ 
nents (guide RNAs) was then expressed by the hu¬ 
man U6 polymerase III promoter. The guide RNAs 
are not only necessary but also enable the Cas9 pro¬ 
tein to gain sufficient specificity to cleave the tar¬ 
get sequence. Mammalian U6 polymerase III pro¬ 
moter transcript starts with a G nucleotide which 
is required for the PAM (protospacer-adjacentmo- 
tif) sequence. The guide RNAs identify this PAM 
sequence (-NGG) at the target site that harbors the 
consensus motif of “G-N20-GG” and cut both DNA 
strands (Figure 4) [47]. Further studies confirm that 
the Cas9 endonuclease can be directed with a single 
RNA molecule and provide RNA-programmable ge¬ 
nome targeting and editing at any desired sequence 
[46]. Additionally, a single CRISPR system enables 
simultaneous modification of various sites within 
the mammalian genome. However, efficiency of the 
RNA-guided nuclease largely depends on the target¬ 
ed site and the cell type [48]. 


Implementations for the Human Genetic 
Diseases 

According to the Human Genome Mutation 
Database, nearly 80% of genetic diseases are caused 
by single base mutations, substitutions, small de¬ 
letions, and insertions. Modifying these mutated 
DNA sequences offers therapeutic benefits for sev¬ 
eral genetic conditions. Genome editing tools are 
thus the new hope for future gene correction ther¬ 
apy applications [49]. In order to achieve a precise 
correction of a mutation or to regulate the expres¬ 
sion of desired genes, ZFNs, TALENs and CRISPR/ 
Gas systems are powerful molecular tools that ac¬ 
curately recognize their target within the 3.2 billion 
base pairs of the human genome [50]. This prop¬ 
erty of genome editing tools creates a new oppor¬ 
tunity to cure hundreds of monogenic diseases. 
The most recent examples of gene correction ap¬ 
plications show that both ZFN and TALEN strat¬ 
egies can be used to correct genetic mutation that 
is responsible for sickle cell anemia. The mutation 
in the human |3-globin (HBB) gene could be cor¬ 
rected in patient-derived induced pluripotent stem 
cells (hiPSCs) [51, 52]. Similarly, disease-causing 
mutations in the Parkinson’s disease-associated al- 
pha-synuclein (SNCA) gene were also corrected in 
hiPS cells that have been generated from fibroblasts 
using ZFNs [53]. In 2011, the biallelic correction of 
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a point mutation (Glu342Lys) in the al-antitryp- 
sin (AlAT) gene causing al-antitrypsin deficien¬ 
cy was achieved [54], It has also been shown that 
ZFNs can efficiently correct the X-SCID muta¬ 
tion in the IL2Rr gene locus in both transformed 
human cells and primary T-cells [55]. These stud¬ 
ies are also important for autologous transplanta¬ 
tion strategies in which patient-derived cells might 
be corrected ex vivo and then reintroduced into do¬ 
nors, with reduced risk of immune response [56]. 
Moreover, it was shown that targeted genome ed¬ 
iting could achieve reading frame correction in the 
dystrophin gene and restore the dystrophin expres¬ 
sion in the cells from DMD patients, including skel¬ 
etal myoblasts and dermal fibroblasts [57]. Genome 
editing with the CRISPR/Cas9 system was utilized 
to disrupt the Pcsk9 gene in vivo with high efficien¬ 
cy, and off-target mutagenesis was not detected in 
ten selected sites at the genome. Pcsk9 inactivation 
has aided in the reduction of the low-density lipo¬ 
protein cholesterol (LDL-C) levels in mice. This ap¬ 
proach thus provides remarkable protection against 
susceptibility for cardiovascular diseases, with a fu¬ 
ture therapeutic potential in humans [58]. Yet an¬ 
other study shows that mice carrying dominant 
mutations in the cataract-causing Crygc gene could 
be corrected by the CRISPR/Cas system. Here, the 
gene correction was achieved via the HR mecha¬ 
nism, with very rare evidence of off-target muta¬ 
tions. Further studies reveal that the hereditary cor¬ 
rection could be achieved at the target locus and re¬ 
sulting mice could transmit the corrected allele to 
their progeny [59]. Lastly, the CRISPR/Cas9 genome 
editing system was successfully used to correct the 
CFTR locus by HR in cultured intestinal stem cells 
of Cystic Fibrosis patients. The corrected allele was 
expressed and shown to be functional in expanded 
organoids. This study also proves that targeted gene 
correction is possible by HR in primary adult stem 
cells (somatic stem cells) derived from patients with 
hereditary single gene defects [60]. 

In addition to the above-mentioned examples of 
gene correction, genes may also be disrupted by ge¬ 
nome editing to generate therapeutic phenotypes. 
For example, the HIV co-receptor CCR5 was dis¬ 
rupted in T-cells to block HIV entry [36]. CCR5 was 
also previously targeted in human hematopoietic 
stem cells by zinc-finger nucleases to limit HIV in¬ 
fection [37]. This CCR5 targeting approach is cur¬ 
rently in Phase 2 clinical trial for the treatment of 
HIV/AIDS as a cell-based therapy. Another study 


targets the glucocorticoid receptor gene by ZFNs 
in CD8+ cytotoxic T lymphocytes (CTL) as part of 
a T cell based cancer immunotherapy. These mod¬ 
ified T-cells have the ability to destroy glioblasto¬ 
ma tumor cells in animals in the presence of glu¬ 
cocorticoids. This initiation is currently in Phase 
1 clinical trial for the evaluation of safety and tol¬ 
erability of engineered T-cells (NCT01082926) [61, 
62]. A recent study also shows that TALENs pro¬ 
vide a potential for the treatment of chronic hep¬ 
atitis B (HBV) infection [63]. The functionality of 
the episomal covalently-closed circular HBV DNA 
(cccDNA) and suppressed viral replication mark¬ 
ers could be destroyed in both cultured cells and in 
vivo. This is a considerable advance in the therapeu¬ 
tic application of TALENs. Gene inactivation strat¬ 
egies can also be used to generate disease models in 
model organisms. TALENs have been used to in¬ 
activate the gene encoding low-density lipoprotein 
(LDL) receptor in pigs, thereby generating a mod¬ 
el for familial hypercholesterolemia [64]. Rats with 
X-Linked Severe Combined Immunodeficiency 
(XSCID) were generated with ZFNs targeting the 
rat interleukin 2 receptor gamma (I12rg) locus [65]. 
Moreover, the renin gene was targeted via ZFNs to 
create renin knockout rats for cardiovascular dis¬ 
ease models [66]. Recent studies demonstrate that 
CRISPR/Cas systems can efficiently and simultane¬ 
ously generate single- and multiple-gene mutations 
in rats [67]. 

Further studies show that, in addition to sin¬ 
gle gene disorders, other complex genetic diseas¬ 
es, such as cancer, can potentially be treated by 
the activation of target genes. For example, the tu¬ 
mor suppressor maspin (SERPINB5) is epigeneti- 
cally silenced in several types of epithelial tumors. 
Reactivation of maspin with zinc-finger transcrip¬ 
tion factors prevents tumor cell metastasis in vitro 
and tumor xenograft growth in vivo [68]. In addition 
to the activation of tumor suppressors, oncogene in¬ 
activation could also be accomplished via ZFN tech¬ 
nology. Falke et al. showed that activation of the ex¬ 
pression of the proapoptotic factor Bax by zinc-fin¬ 
ger transcription factors induces cell death selective¬ 
ly in cancer cells through the inactivation of the tu¬ 
mor suppressor p53 [69]. Lastly, BAK and BAX dele¬ 
tion by zinc-finger nucleases generated apoptosis-re¬ 
sistant mammalian cell lines (CHO cells) for the 
production of improved biopharmaceutical com¬ 
pounds [70]. Large scale and high-throughput gene 
disruptions via the CRISPR/Cas system can greatly 
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facilitate target identification and drug discovery 
studies for pharmaceutical uses [31]. 

A Comparative Analysis of Genome Editing 
Technologies 

The above-summarized examples show that all 
three genome editing technologies (ZFN, TALEN, 
and CRISPR/Cas system) can be used for gene dis¬ 
ruption, gene correction, and genome engineering 
in a variety of host cells and organisms, including 
mammals and humans. However, the efficiency and 
specificity of genome targeting depends on the lo¬ 
cus targeted, the cell type utilized and the proper¬ 
ties of the model organism chosen [31]. Genome¬ 
wide analysis of ZFN specificity reveals that a low 
but reckonable rate of off-target effect is present 
[71]. It has also been shown that ZFNs contain pro¬ 
found cytotoxicity and off-target effects compared 
to TALENs [72]. In one comparative study, TALENs 
were designed to disrupt the human CCR5 receptor 
locus, and successful editing was achieved in up to 
45% of transfected cells. A similar gene disruption 
activity was also achieved by ZFNs in a side-by-side 
comparison. TALENs, however, exhibited a signifi¬ 
cantly reduced cytotoxicity. Moreover, the CCR5- 
specific TALENs revealed minimal off-target activ¬ 
ity at the CCR2 locus compared to the correspond¬ 
ing ZFNs (CCR5 and CCR2 exhibit high sequence 
homology). Principally higher cytotoxic activity and 
off-target effects as well as the complex design pro¬ 
cess of ZFNs are the major drawbacks of this system. 
The labor-intensive requirements to reengineer and 
confirm the activity of the two opposite strand ZF 
motifs as well as optimizing the linker spacers ren¬ 
der ZFNs relatively difficult to optimize. Thus, de¬ 
spite ongoing clinical trials and the positive progress 
of TALENs, the final “boom” of the CRISPR/Cas9 
system surpassed the ZFNs for targeted modifica¬ 
tions of complex genomes. 

TALEN technology provides opportunities for 
targeted gene replacement or mutations at the ge¬ 
nomic sequence of interest. Compared to ZNFs, 
TALENs provide higher targeting flexibility, effi¬ 
ciency and simplicity. The frequent off-target cleav¬ 
age is the primary cause of the cytotoxic effects of 
the ZFNs. It is due to the fact that TALENs exhib¬ 
it higher sensitivity to mismatches that they do not 
harbor any such drawbacks. Moreover, studies show 
that three to four mismatches in the target recogni¬ 
tion site typically block the DNA-binding capacity of 
the TAL effectors [73]. 


The final achievement in genome editing is the 
CRISPR/Cas system. Several studies show a much 
higher target specificity of the CRISPR/Cas system 
compared to both ZFNs and TALENs. Moreover, 
adverse off-target risks of the RNA-guided mecha¬ 
nism are much lower than those of other genome 
editing technologies. Cong et al. showed that a sin¬ 
gle-base mismatch up to II bp 5’ of the PAM com¬ 
pletely blocks the action of Cas9 nuclease. However, 
mutations further upstream of the spacer may not 
diminish the nuclease activity [48]. In contrast to 
ZFPs and TALENs, which require the design and 
verification of a new engineered protein for each tar¬ 
get, the CRISPR system only needs to change the 20 
base pair protospacer sequence within the CRISPR 
expression vector to target a new site. Thus, RNA- 
programmed Cas9 offers an easy opportunity for 
gene targeting and genome editing applications [46]. 
The specificity of RNA-guided endonucleases can 
be established relatively easily by the design of guide 
RNA molecules complementary to the target se¬ 
quence so as to achieve Cas9-mediated DSBs at the 
desired locus. 

One other diversion of the CRISPR/Cas system 
from ZFNs or TALENs is the mechanism of nucle¬ 
ase action that drives DSBs. In CRISPR/Cas, DSBs 
are generated through the Cas9 nuclease with¬ 
out any need for the generation or optimization of 
a chimeric FokI restriction endonuclease. This is 
one of the advantages of the CRISPR/Cas system. 
Following the generation of DSBs at the genome, the 
common endogenous repair mechanisms of the tar¬ 
get cell are activated. The choice of repair process 
is independent of the genome editing tool used for 
the induction of DSBs. Either an error-prone NHEJ 
is activated or, as mentioned above, HR may be used 
to introduce any exogenous homologous sequence 
within the target locus in the presence of a similar 
DNA sequence. Consequently, it has been shown 
that by the use of the CRISPR/Cas system, efficient 
and simple genome editing is possible. Further im¬ 
provements to the CRISPR/Cas system are also re¬ 
quired to increase its efficiency. Some limitations of 
this system include the requirement of the consen¬ 
sus motifs for the Cas nuclease activity or the attain¬ 
ability of the target site due to chromatin and DNA 
methylation states. However, other diverse mem¬ 
bers of the Cas9 family with various PAM sequenc¬ 
es can be readily introduced to overcome these ob¬ 
stacles [48]. Moreover, in a recent study, Veres et 
al. showed off-target mutagenic effects of these 
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systems to be extremely rare [74], By using CRISPR/ 
Cas and TALEN-targeted human pluripotent stem 
cell clones, they performed whole-genome sequenc¬ 
ing at high coverage to determine the degree of mu¬ 
tagenesis in the entire genome. This genome-wide 
analysis study shows that although off-target muta¬ 
tions may be a risk for some cell types, the probabil¬ 
ity of such effects in human pluripotent stem cells 
may be adequately low and not a significant concern 
for disease modeling and similar applications. 

Outlook 

The examples summarized above show that genome 
editing tools are the next novel therapeutic tools 
for the advancement of human health. Besides their 
current widely accepted use for the creation of cell- 
or animal-based genetic models, further clinical tri¬ 
als are also impending, targeting several single gene 
disorders. Although future applications towards the 
cure of infectious diseases and cancer have been 
initiated, we are still far from any efforts target¬ 
ing common diseases. The major reason for this is 
the fact that a multitude of alleles with low pene¬ 
trance but relatively high frequency are the princi¬ 
pal causes of these pathologies. Our current level of 
knowledge is evidently still far from understanding 
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the genetic fundamentals of these multi-component 
variations. A careful look at the clinical trials data¬ 
base deduces that the majority of the on-going trials 
are cancer gene therapy trials. Somatic cell therapy 
targeting cancer cells is a major challenge in itself, 
with multiple hurdles. Finding voluntary patients to 
be enrolled in these trials, however, is not an obsta¬ 
cle. On the other hand, our current understanding 
of the risks of the off-target effects of the genome 
editing tools is a major drawback for any implemen¬ 
tations on children with single gene disorders. The 
unfortunate consequences of previous trials with 
retroviruses necessitate further precautions to be 
taken with such approaches. 

At the forefront of this new genome engineer¬ 
ing revolution, the authors have initiated the prima¬ 
ry research project at Hacettepe University in 2013 
using the CRISPR/Cas system to introduce targeted 
mutations in stem cell lines. Once proven to be ben¬ 
eficial, this approach can be implemented on embry¬ 
onic stem cells for the creation of animal models as 
well. The current perspective on genome engineer¬ 
ing deduces that the future directions are likely to 
provide specific somatic gene correction therapy op¬ 
portunities for several conditions that are currently 
accepted as ’’intractable” for clinicians. 
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